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ABSTRACT 

This paper presents MoLUSC, a new method for generating mock galaxy catalogs from a large scale 
(« 1000 3 Mpc 3 ) dark matter simulation, that requires only modest CPU time and memory allocation. 
The method uses a small-scale (~ 256 3 Mpc 3 ) dark matter simulation on which the GallCS semi- 
analytic code has been run in order to define the transformation from dark matter density to galaxy 
density transformation using a probabilistic treatment. MoLUSC is then applied to a large-scale dark 
matter simulation in order to produce a realistic distribution of galaxies and their associated spectra. 
This permits the fast generation of large-scale mock surveys using relatively low-resolution simulations. 
We describe various tests which have been conducted to validate the method, and demonstrate a first 
application to generate a mock Sloan Digital Sky Survey redshift survey. 
Subject headings: large scale structures - galaxy - cosmology - SDSS 



1. INTRODUCTION 

Cosmological simulations are of great importance for 
studies of the formation of large scale structure in the 
universe. They permit us to objectively test cosmolog- 
ical models against observations, and in the past have 
helped to rule out models. However, analysis of these 
surveys is difficult due to the biases and limitations that 
are inherent in any observational survey of the galaxy 
distribution in the universe. In order to quantify those 
biases and to make detailed comparisons to theoretical 
models, we need to generate realistic mock galaxy sur- 
veys. This is particularly difficult for very large surveys 
such as the Sloan Digital Sky Survey (SDSS) which sur- 
vey a substantial volume of the universe. 

A common way to generate mock catalogs is to carry 
out simulations which model a volume which is at least 
as large as the survey in question, while at the same 
time resolving the smallest dark matter halos that can 
host galaxies of interest (e.g., Jing et al 1998; Yan et al. 
2003). Once halos are identified in such a simulation, 
it can be populated using techniques such as the Halo 
Occupation Distribution (Peacock & Smith 2000; Zhao 
et al 2002). However, for the largest surveys which can 
span thousands of Mpc, it is often prohibitively expensive 
to run such simulations, both in terms of memory and 
cpu. 

Instead, we describe a method which does not iden- 
tify individual halos, but instead uses a smaller-scale, 
but higher-resolution simulation to constrain the relation 
between dark-matter density and galaxy density. The 
higher-resolution simulation does resolve all the relevant 
dark matter halos and so it can use a more accurate 
method to populate the halos. In this paper, we use the 
GallCS semi-analytic model on this smaller-scale simu- 
lation, but in principal any technique to associate halos 
with galaxies could be used. Our method is similar in 
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spirit to Cole et al (1998) and Hamana et al (2002) but 
our method of computing the effective bias is more so- 
phisticated (as it is based on a full-blown semi-analytic 
method). 

The large scale (~ 1000 3 Mpc 3 ) cosmological dark 
matter (DM) simulations presented in this paper wer e 
run using the GADGET-2 public code ([Springel 2005f ). 
GADGET-2 is a massively parallel code for hydrodynam- 
ical cosmological simulations, although we use only its 
dark-matter capabilities. 

The MoLUSC treatment, i.e. converting DM partic- 
ules into a distribution with galaxy properties was made 
using the GALI CS public mock g alaxy catalogs. The 
GallCS project (|Hatton et al. 20031 ) describes hierarchi- 
cal galaxy formation with the so-called "hybrid" ap- 
proach. It uses the outputs of large cosmological N-body 
simulations to get a more realistic description of dark 
matter halos, and a semi-analytic model to describe the 
baryons. Because it keeps a record of the spatial and dy- 
namical information, the hybrid approach opens the way 
to a detailed treatment of galaxy interaction and merg- 
ing. The GallCS model explicitly intends to address the 
issue of the high-redshift star formation rate history in 
a multi- wavelength prospect, from the ultraviolet to the 
sub-millimeter range. 

2. GENERATING GALAXIES FROM DARK MATTER 
SIMULATIONS 

In the following, we make the assumption that the 
galaxy spatial distribution is mainly affected by the un- 
derlying dark matter distribution and that all other pro- 
cesses influencing it can be considered as stochastic and 
have no significant impact on the large scale clustering 
properties Let Si be a large scale dark-matter only simu- 
lation, S s a small scale high resolution dark-matter only 
simulation and G s the resulting galaxy distribution ob- 
tained by using GallCS on S s . The process used by Mo- 
LUSC to generate a galaxy distribution out of Si 
consists of two main steps: 

1. The computation of the bias between dark mat- 
ter and galaxy distribution in S s and G s . This is 
achieved by: 
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(a) Sampling S s and G s density fields over iden- 
tical grids (ps a ( r i) an d PG a ( r i) hereafter). 

(b) Computing the probability 
P(pG a (*i)\ps a (*i)) that for a given grid 
node z, a dark matter density ps s (^i) and a 
galaxy density pG s ( Y i) would be measured. 

(c) Computing the probability that, for a given 
value of pG s { v ) an d ps s (r), a given spectrum 
would be assigned to a galaxy located in r. 

2. The generation of a galaxy distribution Gj that 
respects the probability distributions computed in 
the first step while following the large scale struc- 
ture distribution of Si. This result is obtained by: 

(a) Sampling the density field of Si over a grid 
(ps t (Ti) hereafter). 

(b) Building a density field p G *(ri) from 
PSi( T i) and the probability distribution 

P(PG.(n)\ps.(n))- 

(c) Creating a discrete galaxy distribution whose 
sampled density field is pG*(*i) 

2.1. Bias analysis 

As explained above, the dark matter to galaxy distri- 
bution bias is computed from the sampled density fields 
of S s and G s . There exist many efficient ways for sam- 
pling a density field from a discrete point distribution. 
In this case, we want to keep track of which galaxy con- 
tributed to which sampling grid node in order to be able 
to recover the spectral information. The chosen method 
uses a truncated Gaussian kernel and consists of consid- 
ering the i th particle as a density cloud W(r — r^) cen- 
tered on the particle location r^. For a cubic sampling 
grid with cell size ah -1 Mpc, the number density at the 
k th node is then given by: 



N 



n(r k ) = ^2 W ( T k ~ r»), 



and its mass density is: 



N 



(1) 



(2) 



i=0 



where is the mass of the i th particle and N the total 
number of particles. 

The kernel function is chosen to be a truncated Gaus- 
sian of the form: 



W(r) = 



A 



(4ttL 2 ) 



3/2 



exp 



l|r||), (3) 



where A is a normalization constant, L is the smoothing 
length, II is the Heavyside function and A sets the trun- 
cation length. Given the infinite extend of the Gaussian 
function, the kernel is truncated in order to reduce the 
computation time. Experiments show that choosing a 
sampling length equal to the smoothing length <j = L 
(all information being wiped out at scales smaller than 
L) and setting the truncation length to A = 5 provides 



good results, the error on the measurement compared 
to an infinite extend kernel being of order 10 -7 for a 
homogeneous field. 

Once the sampled density fields ps s (r$) and pG s (r;) are 
computed from S s and G s , the probability P (no \ps ) for 
the galaxy number density at a location r to be nc(r) 
knowing that the dark matter mass density at this same 
location is ps(*) can be obtained. This is achieved by 
applying the following equations: 

N n 

P (n G \p s ) oc y^ 5(p s (r k ) - p s )S(n G (r k ) - n G ) 



f 
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k=i 



P(n G \p s ) dn G = 1 



(4) 

where the sum is computed over the N n nodes of the sam- 
pling grids (which are identical for both distributions) 
and S(r) is the usual Dirac function. Figure [1] shows the 
function P (n G \ps) for three different redshifts (z = 3, 
z = 1 and z = 0) computed from a 512 3 particles in a 
100/z -1 Mpc box dark matter simulation and its GallCS 
counterpart. As expected, two different regimes exist: 
when dark matter density is low, no galaxy formation 
can occur whereas for high densities, the galaxy number 
density is proportional to the dark matter mass density 
(n Gs = bps s ). Between these two regimes, a large range 
of galaxy densities can correspond to a given dark mat- 
ter density depending on the g alax y form ation history. 
At high redshifts (Figures 1(a) [ and l(b)[ ), a change can 
be observed for P (n G \ps) at high ps values. This can 
be explained by the fact that for z = 3, the most mas- 
sive halos gravitational collapse has not occurred yet. 
Hence, the halos mass function is dominated by small 
size objects which cannot all be detected within the sim- 
ulation due to resolution limitations. In fact, as within 
the hierarchical model framework large halos are formed 
by smaller halos fusions, their galaxy content is weaker 
than expected at high redshifts. 

The function P (n G \ps ) describes how the galaxy dis- 
tribution maps to the underlying dark matter distribu- 
tion but does not contain any information about the 
galaxy properties. In order to be able to attribute a spec- 
trum to every galaxy created from Si, we compute the 
probability distribution P (i^(A) \n G: ps ) that a given 
galaxy located at point r has a given spectrum i^(A), 
knowing that the galaxy number density is n G (r) and 
the dark matter mass density is ps(r)- This distribu- 
tion is obtained from G s and S s , using the N synthetic 
spectra i^(A) generated by GallCS in G s : 

P(F,(A)|n G ,p 5 )^ 

^2S(p s (rj) - p s )S(n G (rj) - n G )W(ri - Tj), 

3 = 1 

N 



J2P(Fi(X)\n G ,Ps) = l. 



(5) 

In this equation, is the location of the i th galaxy in 
G S1 Tj the coordinates of the j th grid node and i^(A) the 
spectrum associated to the i th galaxy. This simply states 
that the probability for a given type of galaxy to exist at 
a place where the galaxy and dark matter densities are 
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Fig. 1. — The probability P (no \ps ) that a given galaxy num- 
ber density no corresponds to a dark matter mass density ps for 
different redshifts (logarithmic scales). For low dark matter densi- 
ties, no galaxy formation occur and the galaxy number density is 
thus whereas it appears directly proportional to the dark mat- 
ter density ps for high values of ps- In between this two regimes, 
a wide range of galaxy densities can correspond to a given dark 
matter density. See main text for an explanation of the change of 
behavior with redshift. 



no and ps is proportional to the number of galaxies of 
that type observed in G s close to grid nodes, each being 
weighted by their respective contribution to the value of 
no at that node (hence the factor W(ti — r^)). From a 
more practical point of view, a list of spectra is attributed 
to every pixel in Figure[T]with a weight associated to each 
of them. 

2.2. Generating the galaxy distribution 

The first step of the process consisted of extracting 
information from S s and G s through the compu- 
tation of probability distributions P(ug\ps) and 
P (Fi(X) | no, Ps )• The second step aims to build a 
galaxy distribution G^ out of the large scale dark matter 
simulation Si using P(ug\ps) and P (i*i(A) \ug, Ps )• 
To do so, we first compute the sampled galaxy number 
density field nG*( ri ) corresponding to Si. Using the 
same technique and grid parameters as before, the 
sampled mass density field ps^Vi) is extracted and, for 
every grid node z, a corresponding value of nG*( ri ) is 
randomly selected, following the probability distribution 
P(n G \ps)- 

The galaxy distribution G* can then be generated by 
creating a point distribution whose sampled density field 
is Gf . Let 



N 



||W(r)||=Xy(r- ri ) 



(6) 



be the sampled norm of the kernel W, with i the index 
of a grid node. The fact that we used a truncated Gaus- 
sian kernel implies that ||W(r)|| is not exactly constant 
but rather a-periodic. It is nonetheless possible to show 
empirically that when L < <j, 



max r (HW(r)|l) -min r (HW(r) 
min r (||W(r)||) 



< 1%, 



(7) 



where min r (/(r)) and max r (/(r)) are the minimal and 
maximal values of /(r). So as long as L < a, it is possi- 
ble to consider that ||W(r)|| is a constant and the total 
number of galaxies Ng* 



in is: 



N a 



N g being the total number of grid nodes. 



(8) 



Following the hypothesis that, on the scale at which 
we are working (i.e of order lh~ x Mpc), the galaxy dis- 
tribution should follow the dark matter distribution, the 
galaxy distribution is generated by changing dark mat- 
ter particles from Si into galaxies or removing them fol- 
lowing some criteria. This way, the galaxy distribution 
is assured to follow the large scale distribution of dark 
matter. Knowing the value of the galaxy number density 
field G*, the probability Qi for a dark matter particle to 
be transformed into a galaxy can be expressed as: 



Qi oc 



(9) 



ps(n) ' 

which can be normalized using the fact that a total of 



4 



Sousbie, Courtois, Bryan & Devriendt 



Nq* should be created: 



Qi = N G * 



nc^y 
Ps(*i) . 



Jri Ps(n) 



(10) 



where i corresponds to the index of one of the Ns dark 
matter particles in Si. For every dark matter particle, 
located at position r^, the local galaxy and dark matter 
densities nci^i) and ps(^i) are linearly interpolated from 
the values at surrounding grid nodes, and every particle 
is changed into a galaxy or rejected with a probability 
Qi. The attribution of a spectrum to every generated 
galaxy follows the same process: if a galaxy is created 
at a location where the galaxy and dark matter density 
are tlq and ps, then one of the N s spectra Fi(X) in G s 
is attributed to it, following the probability distribution 
P (Fi(X) |n<3, ps)- In the unlikely case where Qi > 1 
(which means that the local galaxy number density is 
superior to the dark matter number density), a number of 
galaxies equal to the integer part of Qi are generated and 
randomly located on a sphere of radius d centered on r^, d 
being a random number with a probability distribution 
W{d) (i.e. identical to the kernel used for the density 
sampling). 

3. VERIFYING THE GALAXY DISTRIBUTION 

Using the previously described process, one can gen- 
erate a large scale galaxy distribution from a large scale 
dark-matter-only simulation by mimicking the properties 
of a smaller scale galaxy distribution generated using so- 
phisticated semi- analytical models such as GallCS. To 
check the quality of the results, we generated a galaxy 
distribution G* from a 100/i _1 Mpc with 256 3 particles 
simulation S using the properties of the galaxy distri- 
bution G gener ated with GallCS from the same simu- 
lation 4 . Figure 2(a)| shows a 40/z -1 Mpc slice of G and 



|2(b) a 40h 1 Mpc slice of G*. In the two cases, the 
total number of galaxies is very similar: 30, 765 with 
GallCS as opposed to 30,941 for MoLUSC. Moreover, 
the scheme makes the distribution very similar on scales 
larger than the smoothing length of l/i _1 Mpc (halos, fila- 
ments and voids are located at the same place). Nonethe- 
less, galaxy clusters generated by MoLUSC clearly ap- 
pear more spread out than the ones generated with Gal- 
lCS. This is due to the Gaussian smoothing that is part of 
MoLUSC process. The other difference in cluster appear- 
ance is their shape but constitute this time a strength of 
MoLUSC. GallCS uses a spherical collapse approxima- 
tion for halos such that galaxies are distributed com- 
pletely spherically within a halo, but this is not neces- 
sarily the case with MoLUSC which does not require any 
halo identification. This results in galaxy clusters that 
really follow the underlying matter distribution, thus pre- 
senting different geometries. 

The examination of the two point correlation functions 
£(r) in Figure [3] confirms the preceding remarks. Mo- 
LUSC mainly aims to reproduce GallCS galaxy distri- 
bution. The comparison of the correlation functions for 
G (red curve) and the one for G* (black curve) indi- 
cates that they are similar on scales larger than 2h~ x 
Mpc while some differences exist on smaller scales. It 

4 available from http://www.galics.iap.fr 



is clear that below scales of order the smoothing length 
(1/z -1 Mpc here), correlations are weaker with MoLUSC 
in which case they tend to follow the dark matter correla- 
tion function (green curve). Between scales of order the 
smoothing length and 2h~ x Mpc, the opposite situation 
appear: whereas correlations are lacking in G because 
of the spherical halo model used by Galics, this is not 
the case in G*. All these observations tend to show that 
MoLUSC is particularly appropriate for the production 
of large scale distributions of galaxy out of a large scale 
dark matter-only simulation. 

4. REAL SPACE SNAPSHOT TO REDSHIFT CATALOG 
CONVERSION 

Once the simulation has been populated with galaxies, 
the primary difference remaining between a simulation 
and a survey like SDSS is that observing galaxies intro- 
duce biases that are not present in a simulation. So now 
that we have obtained a galaxy distribution, there is still 
to make a mock catalog emulating these biases. First 
of all, when galaxy properties are measured with a tele- 
scope, one often only measures their redshift and their 
apparent magnitude in a given filter. All other properties 
like the absolute magnitude or distance has to be com- 
puted from these values. This of course introduce errors 
and limitations, the main ones being that: 

1. The spectrum of a galaxy is not constant with 
wavelength. So as it is redshifted because of the 
expansion of the universe, the apparent magnitude 
is not measured in the same wavelength range as 
the absolute magnitude. This can be corrected by 
using so called K-corrections, the value of which 
depends on galaxy morphology, redshift and filter 
characteristics. 

2. The measured redshift is used to compute a dis- 
tance assuming it is only due to the expansion of 
the universe. The peculiar velocities are neglected 
in the process and give rise to so-called redshift 
distortions and the "finger of god" effect. 

3. The geometry of a survey is constrained by prac- 
tical matters, giving rise to complex geometry as 
seen with SDSS on fig El 

Some aspects of the p rocedure used are q uite similar to 
the one described in (|Blaizot et al. 20 05). The reader 
can refer to this paper for further details on some points. 

To build a more realistic mock catalog, we can also 
take into account incompleteness with distance, the zone 
of obscuration and one could imagine to add a surface 
brightness selection since the absolute luminosity is avail- 
able from the simulations. 

4.1. Basic equations 

For a given cosmological model, the comoving distance 
between a galaxy at redshift z and an observer in a flat 
universe can be defined as follows: 



D(z) 



c 

Hi 



o Jo 



dz 



V<Wl+*) 3 + fy/(20' 



Q m + — 1 

(11) 

where Q m and are the fraction of matter and dark 
energy respectively. The function f(z) depends on the 
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Fig. 3. — Two-point correlation function £(r) of the dark matter 
distribution (green curve) and of the galaxies distributions gen- 
erated by GallCS (red curve) and MoLUSC (black curve). The 
smoothing length used for MoLUSC is L = l/i -1 Mpc. Part of 
the particles in the dark matter simulation have been randomly- 
removed so that the number of points is the same. 



cosmological model. If we consider the part to be 
modeled by a fluid with an equation of state p = wp 
then we obtain 



/(*) = (! + *) 



3(l+w) 



p = wp 



(12) 



which gives f(z) = 1 in the case of a LCDM model. The 
distance (fTTj) is not directly observable, but corresponds 
to the distance than one can measure directly between 
two galaxies in a simulation box. It can be easily related 
to luminosity distance by oIl = (1 + z)D where is the 
luminosity distance. 

Although Equation [TT] is not analytically solvable in 
the general case, a low redshift approximation can be 
used for our purpose, giving at second order in z: 



D(z) = -- 

^0 



Qo) z : 



(13) 
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with q the deceleration parameter: 



<?0 



+ (1 + 3w) . 



(14) 



Finally the classic relationship between apparent and ab- 
solute magnitude is given by: 

M = m - 5 log (D(z)(l + z)) - 25 + K(z) (15) 

where D(z)(l + z) = d,L is the luminosity distance, M 
the absolute magnitude, m the apparent magnitude and 
K(z) stands for the K-correction term. 

4.2. Mock catalog construction 

One of the major problems of studies of large-scale 
structure is that it is quite difficult to compare theoret- 
ical results to observational data in a fair way. With 
MoLUSC, one can generate a large scale galaxy distribu- 
tion from a dark matter only simulation. Galaxies, being 
generated from an N-body simulation, are distributed in 
a series of data cubes representing universe snapshots 
at different redshifts. A galaxy catalog such as SDSS 
is different in that it represents the galaxy distribution 
as viewed by an observer. Thus, making mock catalogs 
consists of reproducing from the data cubes a galaxy dis- 
tribution similar to what would be observed, taking into 
account all the observational biases. 

Initially, every snapshot contains the following infor- 
mation: 

• The value of the redshift. 

• The location and velocities of all galaxies. 

• For every galaxy, a corresponding spectrum. 

By randomly placing an observer within the z = snap- 
shot, and defining a line of sight direction, it is possible 
to compute the observational properties of the galaxies 
belonging to the volume of the galaxy catalog to mimic, 
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paying attention to the fact that the galaxies should be 
extracted from the snapshot with redshift correspond- 
ing to the distance between the observer and the galaxy. 
The main problem with this method is that the original 
simulation can be smaller in size than the galaxy catalog 
for which a mock catalog is being generated. Of course, 
one benefit of the MoLUSC technique is that very large 
galaxy distributions can be easily generated, but still it 
can be that every snapshot has to be duplicated in or- 
der to obtain a distribution on a large enough volume. 
This technique is called random tiling and consists of 
stacking simulation snapshots after applying to each of 
them random rotations and translations, using the peri- 
odic boundaries condition to ensure that the transformed 
snapshots have the same geometry. The random trans- 
formations are very important in that they prevent the 
final distribution from being periodic as anyway no infor- 
mation exists on scales larger than that of the snapshots. 

Figure [H illustrates the random tilling method. The 
main drawback comes from the fact that a large cluster 
can be cut if it is located at a place where the snapshot 
used to build the mock catalog changes. This problem is 
solved using a friends-of- friends type structure finder: in- 
stead of computing observational properties of the galax- 
ies one by one, it is done for every cluster and depending 
on the cluster center position, all of its galaxies will be 
present or absent of the mock catalogs. The influence of 
rand om tilling on the correlation function can be found 
in (|Blaizot et al. 20051 ). section 3. 

4.3. Final catalogs 

Galaxy catalogs often have complex geometries due to 
observational constraints. It is important to reproduce 
them in order to be able to check the influence of edge 
effects. To do so, the method we use simply consists of 
building a mask from the real catalog and applying it 
to the mock catalog. Once the catalog is built, the only 
step left involves reproducing the main observational bi- 
ases: redshift distortions and spectral redshifting. Red- 
shift distortions are taken into account by using the pe- 
culiar velocities of the galaxies obtained from the simu- 
lation snapshots. To a galaxy at a given distance we add 
a redshift due to hubble flow to an additional term due 
to the peculiar velocity that prevents an exact distance 
measurement. The amplitude of this term is simply given 
by: 



and 



Sz 




1 



(16) 



where v p is the norm of the peculiar velocity projection 
on the line of sight. Finally, the galaxies' absolute and 
apparent magnitudes are computed using GallCS syn- 
thetic spectra as observed through a given filter F(X). 
This way, the i th galaxy with redshift Zi + Szi with a rest 
frame spectrum Pi (A) has an observer frame spectrum: 



if bs (A) 



Pi(A(l + ^)) 

1 + Zi ' 



(17) 



and its absolute and apparent magnitudes Mi{F) and 
rrii(F) are: 



Mi 



rrij 
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P° bs (\)F(\(l + Zi ))d\. 



(19) 



Figure [5] shows the result obtained for a mock SDSS 
DR4 catalog using MoLUSC with different observational 
biases taken into account. 

5. CONCLUSIONS 

In this paper we presented MoLUSC, a tool for extract- 
ing mock galaxy surveys out of large scale pure dark mat- 
ter numerical simulations. Our method consists of first 
computing numerically at a given redshift how the galaxy 
distribution maps to the underlying dark matter distri- 
bution using a high resolution small scale simulation. We 
are then able to reproduce the galaxy distribution char- 
acteristics from a snapshot of any size and resolution, 
while conserving proper statistical properties. 

Galaxy properties are computed as measured by an ob- 
server. Bias effects such as fingers of god, k-corrections 
on magnitudes, catalog geometry are numerically com- 
puted allowing a fair comparison to the observed data 
which is crucial in order to study the formation and evo- 
lution of the large-scale structures in the Universe. 

MoLUSC is being used to provide mock catalogs of 
the SDSS and of the Deep Fields of CFHTLS, for a va- 
riety of cosmologies. The observers are provided with 
mock catalogs matching the survey selection effects such 
as magnitude depth and survey geometry. The virtual 
observations are provided in the corresponding survey 
set of filters. 

This tool provides a realistic galaxy distribution out 
of any large dark matter only simulation without us- 
ing powerful computer hardware, allowing a faster as- 
trophysical analysis of the present large DM simulations. 

This work was supported by the US Space Inter- 
ferometer Mission Dynamics of Nearby Galaxies (SIM- 
DOG) Key Project, by a european Marie Curie pre- 
doctoral gra nt, and by the French Horizon Project 
( http: / / projet-horizon.fr[ ) . Greg Bryan acknowledges 
support from NSF grants AST-0507161, AST-0547823, 
and AST-0606959. 
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Fig. 4. — Illustration of the random tiling method and of the construction of mock catalogs from the initial catalogs. The space is made 
from initial catalogs with different redshifts which have been randomly transformed (rotation, translation). The red dots show the same 
cluster of galaxies. The red circle illustrates the problem of clusters lying close to a side of a box. To create a mock catalog, only the 
galaxies included in the geometry of the initial catalog (here in green) are selected. 
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(c) MOCK-NB (d) MOCK-NBNF 



Fig. 5. — Comparison between the observed SDSS DR4 (a) survey (limited to 350 Mpc) and 3 different virtual SDSS surveys (b, c, 
d) built from the large scale low resolution DM simulation. The virtual catalog MOCK (b) is made taking into account all the observed 
selection effect and observational artifacts as the fingers of god. The MOCK-NB (c) is built while neglecting the bias between galaxies and 
DM. This results in a less contrasted density field. In MOCK-NBNF (d) both the bias and the redshift distortions were not accounted for. 
As a result of this lack of observational artifacts the galaxy clusters appear almost sphericals, not elongated along the line of sight. 



